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(54) NEW PROMOTER DERIVED FROM GENUS ASPERGILLUS 

(57)Abstract: 

PROBLEM TO BE SOLVED: To obtain a new promoter 
comprising a DNA derived from the genus Aspergillus 
having a specific base sequence and allowing a mold 
belonging to the genus Aspergillus transformed by a 
vector comprising the DNA sequence to produce a 
foreign protein. 

SOLUTION: This new promoter is shown by a base 
sequence of, e.g. formula I or II and derived from 
Aspergillus oryzae belonging to the genus Aspergillus, 
and a foreign protein may be produced by a mold 

belonging to the genus Aspergillus transformed by a titon^^ at 

vector comprising the promoter sequence. The promoter ;>;£t*c^ 

is obtained by extracting the whole DNA from Aspergillus | 

oryzae grown in the presence of glucose, passing it t 

through an oligo(dT)cellulose column to separate mRNA ,, i- , 

therefrom, synthesizing cDNA by using it, conventionally t**cxw &&&& m^^^^m^^^f^t war 

forming a cDNA library, and cloning it so as to search a MTO * =LMfi 

gene highly expressed, thus obtaining the resultant 

promoter comprising e.g. an alcohol dehydrogenase gene 

strongly expressed in the presence of glucose. 
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(54) [3e9)0>£«fl 77^M^ift*0»a^^D^^- 



(57) [ S£)] 

[gtfj] T^^A^^ (Aspergillus) Wk 

P^t-fl (alcoholdehydrogenase I) fifc^ ^ ^ 
n/K/l^t^ JJ / — ^III;*-*^— (coproporphyri 
nogen III oxidase) fifi^, ^*y— * • b7^^ 
— (hexosetransporter) Slfc^K t:^h^H4.1 (h 
iston H4.1) JtfiiK 40S5#^-ABaitS17 (40S ri 
bosomal protein Sl^afifl-XHAOS 5 *V~AS6«S 
28 (40Sribosonial protein S28) fee^F-fl^P^E—* 
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3 ] T*^#^*«**Tfc5SI#« l Xtt 
l XfiH*^ 2 ISSccoy p^~^- 0 

t Koyt- I (alcohol dehydrogenase I) 
^ -< y /— ^^III^v'^— (copro 

porphyrinogen III oxidase) iftfis-F\ ^^V"^ • h 
7^7^-^-- (hexose transporter) aSfe^ h 
>H4.l (histon H4.D 40S !) tfV-i^6fCSl 

7 (40S ribosomal protein Sl7) jIS^ 40S U 
A^&®S28 (40S ribosomal protein S28) M&+<Di»l 

[000 1] 

[SSW©i*i-5ftWl T^^^r/V^ (Aspe 

rgillus) JR*ttBcojpr^n*-^-^|»-r5o JEfc 

[0 0 0 2] 

,w n - * a D*»»i- a ^mm^mmm * n c » * 

[0 0 0 3] r.*te>G>a*fc:j:9, 

[0 0 0 4] :©S,»d>^ *^B^*3»t53Si*4^n 



*-6*©^3r5 5- tfffle^cD^n*— [Bi 
otechnology, 5, 368 (1987) ] . hy^r/W- 3 — 

* — [Biotechnology, 7, 596 (1989) ] % T^/V^ 

- [Biotechnology, 6,1419 (1988) ] ($#HPg62-2729 
88) „ r^^*;^ • - K7^S*^7^3-;i/r 
t Ko^t-f iafe^/P*-^- [Biotechnolog 
y, 5, 713 (1987) ] . T X^/U&UX * 1* s 
s<;l>3?jV7> • (D^V^^T^y t K- 3-/1x^7 

ol-T h ft KP^t-f (GAP-DH) 0>:/d*— £ 
^ifflLTS«^61C<£>Ki£ («fM¥3- 187392) CO^ft 

[0 0 0 5] 
[0 0 0 6] 

[0 0 0 7] *«M^J3^Ttt. KT©J:5fcLr±E 

C<Z)/fyA#iRNA^t>TcDNA^ 
U 1S4^ ^-Srffl^t c DNA7>C 7 !/ - 

[0 0 0 9] lo^^^y-^Ki^i^o- 

^ft*fHkfc#it «*.tfsoo«B( G»8.oooae*-c 
«iS $ *lt v > 5 as -c «>ix tf ft 500(1 r*^ a t # 

X.?>tb5 0 ) L N ^^^^ KDNA*ttfflLT#I800bp 

[ooio] ^ 9 UT»6>ix*:«SeWtt, ftcDNA 
oa^KamSrS' ^BR^IXEHfct^ DNA 

^cDNAOig)!W^|TV\ 

[0 0 1 l] w^Lri^£tfc^-^# 
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CR (Polymerase chain reaction) $rfflV^5^fe^J: 

[0 0 12] PCR£fflt>5#j£<Z>-#|fc LT, £JlTO 

ifflLTE coRV, Seal, Oral, PvuII 
h%\,^S s P I*(D6 b pSr»KLT¥Sf*«^«J»f 

[00 13] rKDWM^T^y^-^iifeSlxfc^yA 
DNAifri^ili: LT\ cDNACDJgSIB^^/* 

«*1-CHIBU1^2 k b pgg(7)S^^ttSDN 

[0 0 14] »fioaStLfc^n-v^e>^9^^ K£ 

^JSrAWrU c DN A^^4fiSE^i?rtt c DN A 
O ±«E 5 DNA^^^OcDNA* 

[0 0 15] /n^-^-g^tt5DNA^9 
mmtb* >Q ±M l k b p SS^±£*1-3 

[ 0 o 1 6 J *^<o^n^~^-ti±57g<oj: p (d It 
fllX-tf, 0.1% SDS 0.3mol NaCh 0.03M ^ 

[0 0 17] ^Bo/n^-^-fc f^)Ti^, 0f 

So 

[0018] fi±iim r^/u^-x • *y-tfSr 
-^>y^js x hy 37 f /v^*ftir©t»±**ttffl'C 

■ ^-y-^*rffl^5wi:^gF^bv\ 

[0 0 19] '&btltc7 r V*-*--^<D&m*>'<?'g 



[0 0 2 0] fg±<Dftm&®: t a fl^*ff<0*ifc-efT 9 w 

[0 0 2 1] L4#fe*«WSrB» 
[0 0 2 2] 

r^^4f^-X • (Aspergillus oryzae) ATCC 4 

2149Sr£JLT<0»**f4=r«*Lfco YPDi§i& 
3r*l% v /Ky-<7 P h^2%^0 ! ^^ — ^2%«tf9ft 
-5ig^fe (pH6.0) {-#g#;£SffiU 3<flCi;:iS^T22B* 
U ■»ft*iiLT»fc. »feix*:a« 
*4gft*WB**-CJIlGBU ^OS*. «*S*ftt^ 

[0 0 2 3] (2) cDNA7^f^7!J-^S 

(Chlgwin) ^CD*fe ['M**^* MJ — (B 
iochemistry) % I8# % 5294W (1979) J idfi£oT4.6mg 
(04RNAWc 0 ^y=r (dT) -feyl/P— * • 

#7 A (7r/V/-v>Ttt) fcfltU 1.4mgtf>£RNA3&> 
6,30m g<0/Ky A{+flnRNAiIi#«:»fc. 

[0 0 2 4] ^<7)/-KyA#i)IlRNA$:i^T: x N o t I 
^JPS^^3^{4$:#t?^ , 7 (5 ' -TTCTAGAATTCAG 
CGGCCGCTTTTTTTTTITTTT^^ 

/^h7^^y^-i^ (RNaseH-) (^n^xy 

b^E.coli DNA polymerase I (? V^Ty?) . E.col 
i DNA ligase Woyfy^) Jo J; tFE. coli RNase H 

c DNAi Lfc e 

[0 0 2 5] cDNAM^7^ (Pv^y?) £ 
fflv^y/vitaii:J:!9ffl«cDNAS:lft*Lfc«, Ec 

0 R I IBISMI*fflKfBMaSr#bT^^^- (5 '-AATTCGG 
CACGAGG-3' RtW' -CCTCGTGCCG-3' ) Sr®^ Lfc e 

[0 0 2 6] Iptti^tcDNA^tts Not 

1 WIS»*t?^JBfl:L*:«, E c o R I ioJztfN o t 

fdJ:!?, Utf^fi© c DNASrtfr*»8c DN A 7 

[0027] o) sae^as* 

5 0 0<@ilS'JU. ^y^^ KDNA^iffiLt, Ml 3 
tfvi-y 7 ^ -e- (5 ' - C ACGACGTTGTAAAACGAC - 
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3') , Taq DNA/K ]) ^ 7 — £ (/x'-^y • x;W^ 
f±) . DNAt-7yV.f/f^7^ (/<-^y . x;l/V 
~ it) ^DNAV-^xyf^ (L I-CORfc * 
yvMOOOL) ^fflV^dNf^ • ^^^^ftt^ct 9 
^800bpO&gia^J£^WL*: e 

[0 0 2 8] ^ ^ LT^frfcJ&ggS^Jte, *cDNA 
05' *«S«cC7)J£SgS^J^^A>r^5o GenBank D 
NAJ^SgB^Jx-^^-^^^LT, BLASTX£iB^-Of9 



[0 0 2 9] Zbfc. Sequencher (GeneCodesfl) 
[0 0 3 0] 



Gene naie 


Score 


Source 


Alcohol dehydrogenase I 


11 


Af 


Coproporphyrinogen III oxidase 


4 


Sc 


Hexose transporter 


5 


Sc 


HIston B4.1 


4 


An 


40S Ribosoaal protein S17 


2 


Sc 


40S Ribosoaal protein S28 


2 


Sc 



[0 0 3 1] ^e+^io^Tte. ^T<DXo^-r o 
Score : V is? Mzm/vtc600<DcMk&PmtirX(Dm%L 



Source :GenBankX — * fc»+5 * ^ U"^ 

Sc=Saccharomyces cerevisiae 
Af=Aspergillus flavus 
An=Aspergillus nidulans 

[0 0 3 2] *:<D&^ ^^-xft&TX&K^l, 
X^^m&^-k ItT/^^ • ft Kn^t- tfl (a 
lcohol dehydrogenase I) jtfi^K ^^P/K/H:^ n / 
— y^HI^^i/P* — £ (coproporphyrinogen III oxi 
dase) iUfS^K • h7^^/K-^- (hexose 

transporter) itfc^K t^hVH4.1 (histonH4.1) 
iH2^\ 40S y /ftV~-kgfiKS17 (40S ribosomal prot 
ein S17) 40Sy .AMfi3fS28 (40S ribos 

omal protein S28) ite^frfc^ JL Lfc c 

[0 0 3 3] Hffi#j2 TVWa—A' • T^t if 



IcDc DNAl^lSt^O^n^^-? 
Aspergillus oryzae RIB-40tt (ATCC 42149) frh"r J 

Y (Timberlake and Bernard) <D 
*tt«ot»yyADNA4: N 6bp^Lt¥ 
^EcoRV-C^^k U (P) 5 ' -ACCTGCCC- 
3 * (NH2) *> £ U*5 ' -CTAATACGACTCACTATAGGGCTCGAGCGGCCGC 
CCGGGCAGGT- 3^^457^-4: b fc Q 
[0 0 3 4] ^L<D?J J^VNAmftZmmt U 

S^^AD (5' -CTAATACGACTCACTATAGGGC-3 ' ) 

n^-J—t? I c DN A<D&SgE^^/&bfc:/y^ 
■v— #1 (5 ' -ACGAGGGGCATCTTCACTGGGAGG-3 ' ) &m V ^X 
PCR^fTofc 0 PCRSJSH Expand HF ]) > 

^•vy/N/fAfi) £rfflv\ DNAt — * ^ 

[0 0 3 5] 

(«Hue) 



H20 


20.25 


H 1 




[ 10 X] Reaction buffer 


2.5 


n 1 


[IX] 


dNTPs , Mix 10 mM 


0.5 


M 1 


200 u M 


ir > * ^ y -f - (7" 7>f^AD) 


0.25 


u 1 


5 //M 


T >-^H? 4 (7" 7^-#l) 


0.25 


At 1 


5 At M 


* template (DN A 0.2j/g) 


1 


n 1 




Expand HF DNA# y * y ~tf * y * * 


0.25 


A* 1 


1. 25 u /TEST 




25 


At 1 





*EcoRV#«¥LfcDNA£0.2/ig/lOM 1 J: 5 E *>L*:t>0 

[0 0 3 6] ±1E<OSJSjK25m 1 £0.2m 1EJ£^— T<£> J: 9 4M«Bfi6fc J: t> >-P C R£fTo 

y^tg^ttDNA Thermal CyclerrHi? y h fc G 
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[0 0 3 7] 

95^ 1# HM**^ 

95^, 30#> 74<t. 15S> 70^. 3# ^/V 

95^, 30$> 70^. 15S> 70t. 3# 3^^ 

95^, 30©> 66^. 158? 7(TC X 3# ? /*> 

95t\ 30g> 158? 7(TC, 3# 3-iM^ 

95TC, 30# 58t, 158> 70t, 3# 31M ^ 

95^ 308> 54^. 15g> 70t;, 3# 201^ ^ 

[0038] m&mm&i.s<yoTtfv-x • y^ssc* 

B-e«8LT2.0k b^DNA^SrSt. ^^DNA 

[00 3 9] Wu 7 y^^77; KSrMKLX^ 

«n*WBEWat««>±«KHilR>IIJ*3 K:/£8±j* 
i829bp<£>D N Aft$£^$> r t Sr AS b/c 0 
[0 0 4 0] CCOgl^Lr, 1829bpC0T^3— • ft 

[0041] MMVis a^P^t^ y y^m 
sr^, 6 b p ^mmLx^m^m^mm-r^mmmm 

t ttD r a I £JB^fc 0 

[0 0 4 2] Tyft^^^f^^U, 

/p/K^t^ y ^>iii^^^>^— if c dna^^s 

Sga^J^^^Lfc^7-r^-# 2 (5 1 -GATCGCTTTTCCG 
CTGAGTATCTG-3 ') SrfflV>fc, ^ 7 L-C»£>ftfcl887bp<Z> 

^p^t >f y y — yv-in^^-^ae^-^^ 
[0043] mm&u ^*y—x • h7^/f-^- 

^tK^^y^DNASr^ijbtSKl^ 6 b 

[00 4 4] T y ytt > S*-fy<i-*—k LX, ^ 

*y~* • h7^X#-*-cDNAtDi£SI^I^?>'& 
J*Lfc:/7>r -e— # 3 (5 ' -CTGGGGTAGGAGTATCGGAGTGCT- 
3') &fflv^c 0 ^9 VX'&btl-ft2Q9\bp<D^*y— X • 

h 7 > * * -a e^F o ^ o * - 9 - <z> ±sssa^j z 
mvm^ : 3 

[0 0 4 5] mmms \z^h>MA<OcMA\ZiM^^ 



yADNASr^iB^-raW^, 6bpSr»»LT¥» 
^fcfflW^SSJKKSi USspl £fflv>£ 0 
[0 0 4 6] £fc, 7yfty77 P 7^'7-tlt, b 

* b>H4.1 c DNA©«lEJ1^6**Lfcy?>f^- 
#4 (5 ' -AGGTTGAGTGGGAAAATGTGGAGA-3 ' ) Srffll^fc. - 
b LT*#£>ix£l915bp(Dt * hXH4.1^e^-^^n^r- 

[0 0 4 7] H^«6 40Sy^V f -^SeSS17OcDNA 

[0 0 4 8] £fc. ryfty^7^^iLt, 40 
sy^^y-^^emsn c DNAojesia5ij*>b^L 

9 -v- #5(5' - GCTTGGGGTAGT A ACGCTC AATG A - 3 ' ) 
Srfflv^o -9 LT»e>*bfcl569bp©40Sy tfV-A® 

fiMsi73te^-<^yn^-^-^j©s®a^j^ga^j# 

* : 5 fc^i". 

[0 0 4 9] M1&&\7 40Sy >K*y— A«eRS28©cDNA 

^y-^^yy ADNA^4iftt5«t, 6bp£i& 

&Lx¥mMM\z-®%ri-z>mimmt us sp i^ 

[0050] ryfty^y^-f^^tr, 40 

S y y — A M 6 MS28 c D N A ©lg*ffi^J d» 6 L 
tc^y^^— # 6 (5 ' -AAGCACCACCGAAGGGAGAGGACT-3 ' ) 

[0 0 5 1] 7/^-/^7tKP^t-f 

-for^^^u^ • ^-y-tf-eco^^ 

(1) «I6*^-^n-y-f|gfIS^ty 

*lt«|2-C«t»LfcJB2.0 kb^DNAWr^SrttSt-LT. P 
CR^ Jt t)T/W3-/V • rt Ko^t-fllfi^n* 
-^-Sr^tfl.O kb<^DNA»r>T**#fco PCRKlS^fflV^fc 

ty^/7-f-7": 5 ' -GTCGACGAACTATTACGGAGTACATAGC- 
3' 

T^^VX-fyj—?— : 5 ' -GTCGACTTTGACTTTGGGATC 
TTGTGTT-3' 

[oo52] ^©»>t-srs»jpfi«*sai i m\m. *mw 
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-HMGtt1rtZ>79x$ KpBR 
J275 (4.5 kb, ^n>7^a) ©SallSJ&lClf A U 
<E^*fij3)SKB*r$j^#A*^fc^7^^ KpBRJAP (5.5 
kb) Srftfc. 

[0 0 5 3] r^^'t!)fOi;7 ta 
^ae^*^tf2.9-kb BglllW^ (Biosci. Biotech. B 
iochem.. 60#161-163H.1996) / 7 — tfjge-^ 

O*-^*-*~ffli$0.6-kb EcoRV-HindIII»T>i-^pBlu 
escript <*h?*S?— Vfcfc) OEcoRV-HindlllSMi^^ 
t^p-^V^U -7?*^ KpBET (3.6 kb) <H§fc 0 

[0 0 5 4] /7^-; KpBRJAPtf^T^a-^ • xfc K 

*j3-^u^ ps^- tffcg^ Sr&*f2.8-kb Pstl-EcoRI 
i&rtfSrStfUU KpBETG>PstI-EcoRI»{fc 

b**tr/7*$ KpBAPETG (6.4 kb) 

[0055] (2) -krnm&Jkz-pw*-'?— 

hZTx^/ix^/i-x • a-y-t?^*A-t-5fc 

-77 * \ KpBAPETG^5>»JI»»*PstIRt/HindIII^<k^ 
J; 9, 3.4-kb<D>c»»*3Kj5-^y^p=^— tfitfc^ 
HSi^-fe^ h^ir> Tx^fr^frx . ;*-y i?<z>7£ 
Ktetft^^^-fcLrniaDiBfi^Sr^r+S^^^ KpN 
3 (4#Hf]¥6-245777) CDPstl-Hindlllffiffifcff AU * 

(D-fJ*^ KpNAPETG (11.6 kb) £PfgLfc 0 
[0 0 5 6] (3) fefiftg 

77 7^% KpNAPETGSrUnkels (Mol. Gen. Gene 

t., 218# 99-104M.1989) Hftot7^^^ • 
^■y^niaD^ti«A01.1(w*ALfc o »&*LfcJg«E8l 
&£2%y/l<^— *&O*50 Mg/ml 5-:/n^_4-^nn-3- 
>f > 3 -D-^/l^ KSr^tfCzapek-Dcwr/W- 



[0 0 5 7] tfe^^LWI6^1o^**Sa 

apek-Dojf»»-C8»IB#*L. »e»tbfcB#©JMBffltt 
ttii&tpCD ft-V^t nc^^-^tt^ Jefferson ^><D^& 

(Proc. Natl. Acad. Sci. U.S.A. , 83#, 8447-8451. 
1986^) tl)SUc^5, JtStttt78 nmol p-^hn 
7*y-/Wmln/ing-SeK"e*>ofc 0 ^tU^U pN3 
**?gK«s«#fi, <lnmol p -^hP7x/-/U/ ffl in/iD 

[0 0 5 8] 
[0 0 5 9] 

[ga?y*] 

■JM«* : 1 
EJIJtOftS : 1 8 2 9 

mm<om. : 

K^JcOffil : Genomic DNA 
^^C^^'f'iE-^ : promoter 



ATCAATTTAA AGGCTATAGA TGGCTAAGAC TTTGGGGGCG ATACAGATCG GAGTGTTGGA 60 

AAGGGGCTTG CTTTCGTTTG ACAGCATAAG CTATCGTGGT GTTACAGCGG TAGCAAACAC 120 

GTGGAACAGG AATCCTTATA TTGTCTAGAT CATAGACATG TCAAACATTT CAGACAATCC 180 

CCTTCACTAG CAGACTACAC ACCCAAATGC TATCAATGAA GAAACGGACA GAATTATATG 240 

TACTTCTACA GAGGGTGGAG GTCAGGTGGC TTCCGAAAGG TCCTTCGCCC GTAAGGTACT 300 

CCTGCTAGGC TTTGAAAGTT TTAAAAACCC CGCTATCTGC CCTCAAAAAC TCTTTTGTCT 360 

TTCTCAATTG ACAGGAGGGT CTGCGCGGAG GATGTTATTG TCTCCGTATC CATGAGTGCG 420 

ATGAGAGAAG ACAGTAATGA TGACGACGAG AACAGAATCA CAAACAAGGT GATCCTACAG 480 

ATGTTACTGA AGACACCATC TGCTCTGGGT CGTAAAATTG TCGCTGACTT TCAGGGAGAA 540 

GTTCAAGTCT TTCTTTTATA GCAGGAACTG AACCCAATCA CCGACCTCAT GTTCCATACG 600 

GCTCGCTGGT TAGTCGTCCG GATATGGGGA AGCTGCCGCT GTTAGGTATC ACGATTATAC 660 

ATTAATTTGC GGTGCCTTCC ATATGTTGAG CTCCGTACGT CCGGGGTCAG CATGGGAAGA 720 

ATTAGATGAA AGCGCAGCCA TCCCGCATCT CAGCAAATCC CGAGAAAAAG CAAGCAAGAA 780 

GGAAGGGAGG GGGTTTAACG GAGTAGATGG GGCGAAGTGG CTGCCATGCT GAACTATTAC 840 

GCAGTACATA GCGAGAAGTA CGAACATCCA CCAACACACA AACCGAACTA CCGCCTCACA 900 
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ATTCCGCCTG AGGCATCCAT GCCATCCTTA AAACTGACTA GGGTTGCCAC TTCCATTGCC 960 
CATCAGTCGA TCGAAATCAT CTGATTTGAA GGGTCTCGGT TGAAACGACA CTACAGTATC 1020 
GAACGGGATT TGCTTCTGGA GTCTCCCAGC TTCAGCTCGA GTCTATCTTA TTCGGTGGTC 1080 
TATCCCGAAA TTAAAAGCTT GACGTTTGAC AGGGAACCTT TAAAAAAAAA GAAAAGAAGA 1140 
TCCTTTCTTC TTTCCCCGCA TGAGATACGG CCAATTTTGC TATATTTGGA AAGGCTTGGC 1200 
CCAGGATCTG GACTCACTTG ACAGGGAAGC CTTTTCGGGG GAAGGGAAAC TCTGTGACTC 1260 
TGTACGGGAG TACTACTTCC TACTTACCAC TTACTACTTA CTACTGACTT GTTAGTTTAA 1320 
AATCGAAGGA AGTGCTTATG CGCCATCGTA CCGATCGGGC AATGAGCCTA ACCCCGATGA 1380 
CGTCGGCCCA TGGATCTCCT GTTGTTGCCG TTCATTCGGT GGGCTGATAA TCTTCCAATG 1440 
ATCCAAAAAT CGTTTGATCG TCTTGGAATA GCTTCGGTGT GTCTCACAGA TTCTCTTTCA 1500 
TGGTTTTTTG ACCTTTCCTT TCCCTCTCTC CGGTGACTCG TTTTCCATAT GTCCGTCTAG 1560 
ATGTGCGTGT GTATCACTAC AACACCATTC ACCTTCATTC ACCACTATTA TTCCCACCCT 1620 
ATAATACGAC GTCATGCCTA TGAGATTCCC GGGTTGGATG ACCGTCACCC CACCCATTGC 1680 
TTGGATTCTA GACTTCTATA AGAACTGTCC AGCGTCTCAC CATCAGCTTA CTTCACTTTC 1740 
TCATCACTTC TCTCATCGTG TAAATCAACC CTAACTCCAC CGCAACCTCC TTTCTTCCCT 1800 
TTCTTTCAAC ACAAGATCCC AAAGTCAAA 1829 



TAAAGGGAAT ACGGAAAACC TGATATAAAC GCCTGCACAC ATGAAGAATC GCCGCAAGTC 60 

TAACCAGCTC ATTCGCCTCA TACCGGAGTG GCTGGTCACA CACCCGGCCC T AAA AC A AAA 120 

CACATACTGA CGGCGAMGAG TTACATATCT TGGGAGAACT GCTTATCAGA GCTTAAAGAT 180 

CGCCGAAGTG TTTTCCCCCG AATACCTAGA GACTATGTCA ATATGTGCCG AAAACCGCCT 240 

TTGAAGAATA AAGCATACGT GATATAAAGA AAGCCATCTT CATCCTTGTG TTCTTCGTAG 300 

ATGCTGCTCA TAAGTGCAGC TGTTGGTGGT AGCACCTCGT CGACGAAGAT GAAGATAGCC 360 

TTCTCGGGAG ACAGTTTGAT GCGCTTGCGG ATAACATACA CGAACTGCCC GACGGTAAGG 420 

TCTGCAGGAA CAAGATACTT CTTCTTATCA ATAGTGGCGA TGTCCGACTT CTCGACTTTC 480 

TCACAGATTA CCTACAAGTG AAACTCGAGT TAGCATTCTA AACAAGCGGT GAGTAGACAT 540 

CTTCCTTCCC TGAGGAAGAA CAGCTCAAGG GCAAGAATCA TCTACCAAAG TACGGATCAG 600 

ATAGAGTTGT GCGTACTGGA ATGCGATCAG CGTATTTCTG ACGGATACGC TCGGCTTCTG 660 

CCTTGCGCTT CTCAAAGGGG TGCTCGTCCT TGAACTTGGA GCGCATATTG ATGGATCGAA 720 

TCAGTTAATG GATACAACTA GGTCGGTTTA CTATGAGTAC TAGGTACATC AACGAGATAG 780 

ACGTAGATGG TGTGAGGTGC AGAGTGATGT AAATGAGATG GTTGGTGTAA TAGGTGAACA 840 

GTAGGAAGAA GGTGGATGAT GACAAAAGGG AGAGCAAAGC CCCGCGTCAT GGGTAGTACC 900 

ACGGCTGATA AGGTCGTCAG GCAAGAGCGG ATCACGGGGT GGAGGCGGTG AGCAGCCCCA 960 

CACGTGAGTC ACGATCAGCC TTGAGCCTTC AGGTGCCCCA AGTCATCCAT TTAATTTGAT 1020 

TTGATTTATG TCATCTTCAT TAAACAGATA CCTCATGGAC ACACGTCATA TTCTCACGTG 1080 

GGGAGCCTGG GTGATGTCCC GGTGATATCG GGTCTATAGT GCCCTACAAA GATGTACTAC 1140 

TGAACTGAGT AGCTGTGGGT AGATGCGGTG TCCGCCTCCA CCTGATCAGC GGCTAGACTA 1200 

CTGGTGCGCG CTGTCTCCAT AGAGGGCTCT TGATAGGCTC TGTTCCTGGA ACTTGTAGTC 1260 

GGTATTACAA TCATAGTCAA GTTCCAAGAG TATATATAAT CCCAATTCTG TGATGACTCT 1320 

CACTCAACTT AATCTCATTA CAAGCTCCAT TATGACAAGC CAACAACTTT ACATGCATGA 1380 

TCAACTCGAT GGCCAATCCT CAACGCAATG CAAACCAGGC AAACTACTTA TCACAGCGTC 1440 

ATCTGACTAA GCAGGAACTG GGACATGTGC CAAACAATGC AAACAGCGTA TATCCCGGAT 1500 

ATTCTGACGA CGAGTGGTCC CATACAGTTC CAGTCCAGAG AAGGTAACCT CACGATTAGG 1560 

CTACCTCTAT TTCTTGAGAT AGGTGGAGCC GTTTGAGAGA GCGTGCACCC ACTCATATCC 1620 

TTCCGGTACC ACCTGATCAT TCCATCTATC AGCCAATTTT CAATTCCATC TTATAAAACT 1680 



[0 0 6 0] gfi^lj#-^ : 2 
SE^JtOg^ : 1 8 8 7 



SH^JoaS : Genomic DNA 
MM 

%M*x : T*^^)^* • *V if 
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ATGACCTTGT CCCGGCAATA TATGCCTCTT CGGAGGTCAC TCTGCCACTT CTCAGCGCAG 1740 

ACCAGACGCG CACCTTGCAA CTTTCAGAAT ACATTTCGCA GATACTCAGC GGAAAAGCGA 1800 

TCCAGCGAAC CGAAATCTTT TGTCGTGTGG AGGCCATACC TCCGGCTGGC CATTGGTGTC 1860 

CCATTCATCG GAGCTCTGAT ATATTCC 1887 
[0 0 6 1] IE^IJ#^- : 3 E*U©«£j : Genomic DNA 

m<o& : em©** 

h #n i/- : BAR «#»S:^-ME# : promoter 



AWTTGCGGTG 


60 


GCAACAATGC 


120 


AAGGAAAAAG 


180 


TACAAACACA 


240 


GACCACGAAG 


300 


TCTGCATGCA 


360 


GCCGAKATAT 


420 


CTGAGAGATT 


480 


GAAATCTCAG 


540 


AGTCTGCACC 


600 


CTTGATGCGA 


660 


GAASTATTKC 


720 


ACGNTGCCGA 


780 


AGCAGAAGAC 


840 


CCCAATATTG 


900 


TTGGCACAGA 


960 


TACCCGACTC 


1020 


ATGACTCTTG 


1080 


AATTAAATTT 


1140 


AACATCGTTA 1200 


TAAACAATTG 


1260 


CCTGAGAAGT 


1320 


CTACCACCGT 


1380 


TAATCGAGAT 1440 


TAAAATAAAT 1500 


TCCCCGTGAC 


1560 


GACGGCGAGC 


1620 


GTCGTCTCCA 


1680 


AGAACACACT 


1740 


GGCCCCCGTT 


1800 


GATGGAGGCG 


1860 


ATTTTCAATT 1920 


TATATATCTC 


1980 


TTCATCCCTT 2040 


T 


2091 



[00 6 2] SE^J#-t : 4 &3\<DW& : Genomic DNA 

ffi?U<0fi£ : 1915 

htfv if- : KSItf: IE* : promoter 
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ATTCTATAGA CTTGGTAGAA GTAGAACCTT CCCTGGTCGT ATATGGAGTA CCGAACGCGG 60 
TGGGCAGTAC ACCAGCAACA CCACCAGCGT ACACACGCTC AACAGTTCGC GTCGCTAAAG 120 
GCGCGCGACC GCGATAGTGG CATACTTCTT ACAAGAGAAA ACAGCGCGAG ATTAAGTCAC 180 
GACAGTAATG AATTGGATTA TTCGTCATAG AATCTGCATT TCGAATCTAT GCCATCCGGT 240 
GGCAATTCAT AGAGAATACC ACTTGGTCAA GTAAAATTTC TTAATGTACA TGCTCGGATG 300 
CATCCCATGA GATCCAATCC ACTCAGCCGC GTCGTCGGCA GCCTACTATT TATTGCGCAG 360 
CAAAACGAAA ACTAATCCCA GTGCCGAAAC CCATTGATAT TCGCAAATGA AAAAAAAGAA 420 
CCCATCATGT ATCGCCAGTC GTGAAAAAGA AATAAAACCC CTAATCGCCT ACCCAGAGAC 480 
AACCAGAAAA ACCGGACTCA TTAGAGAATA ATCTAAGAGC GCTCACCACG GAGACGGCGG 540 
GCGAGCTGGA TGTCCTTGGA CTGGATGGTA ACACGCTTGG CGTGGATGGC GCAGAGGTTG 600 
GTGTCCTCGA AAAGAGAGAC GAGGTACGCC TCAACGGACT CCTGGAGAGC ACCGATGGCG 660 
GAGGACTGGA AACGGAGGTC GGACTTGAAA TCCTGGGCGA TTTCACGCAC CAGACGCTGG 720 
AAGGGGAGCT TGCGGATCAG AAGCTCGGTG GACTTCTGGT AGCGACGCAT CTCACGGAGA 780 
GCGACGGTAC CTTTTCATGA CTGTTAGTGA TGACGCGATG ATGGTTTGCC ACATCATAAG 840 
ATAACTTACC AGGCTTGTAA CGGTGAGGCT TCTTGACACC TCCGGTAGAG GGAGCAGCCT 900 
TACCGGCGGC CTTGGAGGCG AGCTGCTTAC GAGGGGCCTT GCCACCAGTG GACTTACCTG 960 
GATGGATGTT AGTGATGATG ATTCAAGACG CGATGTATCA CTCATGGGGA CGCGACCCTG 1020 
ACGCGGGGCG ACGACACCAC AAGACAAAGG ATAGTAAGGT TAAGACTTAC GGGCACTCTG 1080 
CTTAGTTCTG GCCATCTTTA GAGATAAGGT TTGATGGATT TAGTCGACTT TGAGATGAAA 1140 
ATAGATAAAA GTCGATGGTT GTCGATGATA ATGTGTTGGC GGATGAGGAG ATGAAAGAAG 1200 
GATGGAGGAG GCGGGTGGGG GAAGGTATTT GTAGTAAGCG AAGGGTGGGT GGTGATCTGA 1260 
TGGCGTTGCC AGCAATTTGG GTGCCCTTAG CGTCGTTTCC CTTTGATCAA AAAAGCGCTC 1320 
GGAAAGCAGC ACACCATAAG CCGTGATTTC GGGGGTCACT TTTGGCTGCA GAATCAAAGG 1380 
GTTACCTTGC GGGCGGTCAA GGAATCCGCG CCCAAGGCAG GAATGAAGTG TGGTCTGTGC 1440 
GTTAAGCGAG TGACGGACCA ATAGAATCGT AGGATCACTG ATCTAGCCTG TCTCCCTACC 1500 
ACGTCTTGAT CCCAACCACC GGTAAGGTAT GTGAAGTAAT TCAGTAGATC AAAATGAAAG 1560 
TTGGTATGCT CAGGAATATC TCTGGTCAAT TTTGATCCCT GATCTCGGTG GTTCAGAGGC 1620 
ATTCCGTGAG GATCAAAGTT GATCGTAGCG GAATCATCAC TTCTCGGTCA CGTTGCCGAC 1680 
AGATCAAACC CGCCGGGAGG ATCCTCGTCT CTCATTGGTC GATCCTCTTG TAGCGCGTGA 1740 
TTCCAACGCC CTCCCCGCGC TCTTCCTTTT CCTATACAAA TATCACCCGT CCCATGGCAG 1800 
CTGTCCCAAT CTTTCCCTTC CCATCTCTCT CACATCTGTC ATCCAATTAA CCACCTATCT 1860 
CTCCACATTT TCCCACTCAA CCTTAAACAA GTTAACTCTT TAATCAACCA TCAAT 1915 



ATCGGAGGTC TTGGGCTCAA TGAAGTGAAT ATATAGTGGA GGGCTCTCCG CACTACTTTG 60 

TTGAAAGAYA ATCGATACTT GCTCGATTTA GTAGTCAAAG CGCCGAACTG AGCCTTTCCA 120 

TGGCAATAGT CAAATGAAAT AACTACTGGA CTAGATCTCA TCATCAAACC ATTGTATGTT 180 

TAGCTCACGG GGATGTTGTT CCTCGGTTAC TACCCGCTTT GTAGAGTCAT GGAGCAGAAT 240 

TCCGAAATGA GTTATAGTGG GCAAAAAACA TTATTCTTAT GTTGCCAGAA AAAAATACTA 300 

CGTTTTTCTC CTTCAACCAT TGTCTTGCGA GTCAACTAGC AGCATATCAG ACTGTGACAG 360 

TAGTCCGTTG CCCTCACCGG CCTATCGGAC GAGGTGTTTT TGTGTTGCTA CCCTTCTGGG 420 

TTTAGTGAGT TGGACCAGAT TTTGGTAAAT GTTGCACGTG ACTCGGGTCA CATGAGGAGC 480 

TCTTCAAAGA GTGCGCTATT CCTGAACAGG TACTCTTGAC GCTAGGCAAT TTCAGGCACA 540 

ACTCAAGGCG CCCGCAAGTG CCCGCTCAGT GGCCTACTCG AAAATACCAA CTCTTTGGGT 600 

GTGGTTCGCT ATAAGGGCTT TCGCCTTTAC ATAGTGACAT GAATAACCAC GGAGTATATA 660 



Bffi\<D-&£ : 1569 
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mm 

#?S^r^*f"iE^- : promoter 
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GGTCCTTGGC TAATCGTGCG AAATCCAAGT ATCGATTCAA CTGCATGATT GTGATAGCTT 720 
GGTAGAGTAG CATAGCGGCA AGAGCAATCT CCGAGATGGC TAATATCAAG AATTTTACAA 780 
GATACTCGGA CATAAGTCTG CAGAGGTGAC TTCTCAAAGT TCTTTCCTGG AGGCTGTTCA 840 
ACGCGATGCC ATTCTCTTGA CCCCACATTG ATTGAATACG TCCCCCACAG CTTCACTGGA 900 
ACATGTTGGC TTGTGCCAAG TGAGAGCAAT CGTACCTCAT CTCATCATAT ATGGATATAT 960 
GTCGTATACC TGCTTCCGAA GGTTGAAAGT CAAGGATGGA AAAGAAAAAG GAGGAGAGTT 1020 
CTAGATTCAT GCGGGGGAGT GGTTGTGGTG TTACTAACTT ACTCCGTACT GATCCTACGA 1080 
CTGCCAGCTG ATATTCATAT GAATATCGAA TGATGATTGG GTGGATGGGG TTTCTTCCGA 1140 
AAGAGGGTTA GTTCGTTGTG GAAGGGTAGA TTTTACGATT CCAAGATAAT GTGGGATAGG 1200 
GCTAACGCGG CCTGCCTGAA CCCTCAGGCC GGAGCCGGGT GTTACATACT CACCGCCCAG 1260 
CGGCTGAGTT AATGAGAACA TACACATGAT TCGAAGTTTC AGGTCGATGG TGGAGGGGAT 1320 
ACATTCAAAA TTAGTACTTA AGAGAATTAA ATACGAGGCT AATGAGGTTA ACTGCTCTAC 1380 
TTACCCCACG AGAGGAGGAA AAACTGCACC ATCATCGTTT GGTGGTGTTG GTCATGTGAT 1440 
CTTCAATCAC CTGACTCTGA ACCCGTTAAG CGAGACTTGG CAGAAATCGG AATGCACACC 1500 
AAAATTGCCC ATCCTCACAA CCACCCGAAC AACATCGCAG CTTTCGAGGT CGACAACTAA 1560 
TACGCCAAA 1569 



CAGGTATTTC CTTTACTCGG TCCAGCC™ CCATTTCCCT CCATTGAATG ACMCACTGT 60 

CGCATTTTTT ATACGCCAAT AATAGCCATG GCCATTCTCT CACAACATCA CAGGTCAAGC 120 

AGACTGCAAG TGGTTTCTTG TGGTGGCTGT ACTATCAAAA CGATAGAATT CCCATTTGCT 180 

TCGTTCGCGG TAGTAACCCC CACCAATGCT TTGTGCACTT GATCAAGTGT ACACGTCCAC 240 

TGCCTGTCAA AATGGCACGT AATACATTCC AATCCACAGG CTAGAAGGCT TGTGCGGTGT 300 

CCACAGACGA TCAATCCTCA TAGGTTTCTG ACAGTCTCAC CCAGTGTTAG CCGTCGATGC 360 

GGTCTCCGTG GTTATGCTTT CATTATCACT CCATATGTCT CTGCTCTCCC TCTCCACTGC 420 

GCGCATAGCT TGAGCAACCT CCTCAAGCTT AGCTACCACA TCCAATACGC CGTTGTTGAC 480 

CCTATCGAAA TAGCGATCAT TCGCTTCACG CCTCTCGCGT GCCAGAAGAA TGTCCTTTTC 540 

TGTTTCCTCA TCTAGATTCC TCTGGGAGGC ATCATCCGTT GCGGTTTGTT CTCCCGTGTG 600 

ACTAGTGGCA GGAGAGGATG CTCTTGGTCT TGATTGACTC TTGGTAGCTG CATTCTTCGG 660 

ATCAACTACC TCATCAGCTT CGCTTCGAGC CCAGGCCTGG GCAAGCCTCA GCTGCTTTGT 720 

TCGCAGGTCT TCAAGGGCCT CAGAGTGCTT TTCGCCAAAG ACATGTTCAC TAGCTACCGC 780 

ACTCAGCTCG ACCTCTGACA AGGCATTCTG CATTTCTTGC CAGATGGCGT CCGAGCGATC 840 

AAGTTCGCCT TTTGATTGTG TACGAGATAG AAAGGGGGAG AGAGAGCCTG GGGTCTCAAG 900 

GCCCCCTTCC TGGCGCGAGG AACGATTCCG CTGCAGCGCA GGACGCTGAA CCCCTCCACC 960 

GGGAGAAACC GAGGGCACCG GAGGTGTGAG AGATATGTTA GATCCATTAT GGGAAGTGGG 1020 

TGGACCTTGA AAAAGAACAG GGGGAGGGAG ACGCGCAGAG GGAATTTTCC TCAGATTCTC 1080 

TAATTTTGGA GCCGCATCCA TGGTGTTCGC GAAGTATACG TTGTTGGCAG GCGAACTAAA 1140 

ACAGAACAAG CTCCAATTGA CAGCCGTAAA TAAAGGGAGC TTAATCTCCG CATACAATCA 1200 

ATGCTGACCG TCACTGATAA GGGGCTTATC GATATCGGAG AACGGCGGAA GTTGTTCCGA 1260 

TCCCGTGCCT CGCTTAGCGG CCGATTAGGG CGGAGCGCGC ACAGCAGAAC ATTTTGGTGA 1320 

GCCTTCAGCG ACACCGCCGG CCGTGAAAGG AACCCTTCGA CTCTCGACAA CCATCCCCAT 1380 

CAGGCCTTTC GGCAATCTAT ATCAGTCGGT ATGTATATTG TTGCTGCTAT TTTTTGATGA 1440 

AATATATTCG TGTCTAGGGA CTCCCGnCA ACATCGAATC TTTTGATTTT TGATTTTCCA 1500 

CCGCCCCGGC CATGTACTTC TGAAATCCAC GTCCCCTCCA CAAATCACGA AATCACCAAG 1560 

CATCAGATTG CCACAGCAAG ATCTAACGCA TTCTTCTTCT ACAGACCGTC CTTCGAACAT 1620 

AATCCGCCAA G 1631 
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